A Preliminary Study on Methods for Retaining Data Quality Problems in Automatically Generated Test Data
نویسندگان
چکیده
Data in an organisation often contains business secrets that organisations do not want to release. However, there are occasions when it is necessary for an organisation to release its data such as when outsourcing work or using the cloud for Data Quality (DQ) related tasks like data cleansing. Currently, there is no mechanism that allows organisations to release their data for DQ tasks while ensuring that it is suitably protected from releasing business related secrets. The aim of this paper is therefore to present our current progress on determining which methods are able to modify secret data and retain DQ problems. So far we have identified the ways in which data swapping and the SHA-2 hash function alterations methods can be used to preserve missing data, incorrectly formatted values, and domain violations DQ problems while minimising the risk of disclosing secrets.
منابع مشابه
An Adaptive Approach to Increase Accuracy of Forward Algorithm for Solving Evaluation Problems on Unstable Statistical Data Set
Nowadays, Hidden Markov models are extensively utilized for modeling stochastic processes. These models help researchers establish and implement the desired theoretical foundations using Markov algorithms such as Forward one. however, Using Stability hypothesis and the mean statistic for determining the values of Markov functions on unstable statistical data set has led to a significant reducti...
متن کاملComparing two teaching strategies Lecture and PBL, on learning and retaining in nursing students
Introduction. Health care services require nurses , who are critical thinkers,with high learning Skills, who can solve problems for which there are often no standard solutions. Therefore, a change in the way teachers teach merits consideration. Frost argued that pbl equiped nurses with skills demanded by society. Methods. With above consideration, this research , that is a semi-exprimenta...
متن کاملRetaining Customers Using Clustering and Association Rules in Insurance Industry: A Case Study
This study clusters customers and finds the characteristics of different groups in a life insurance company in order to find a way for prediction of customer behavior based on payment. The approach is to use clustering and association rules based on CRISP-DM methodology in data mining. The researcher could classify customers of each policy in three different clusters, using association rules. A...
متن کاملTalent management in Handball: Identifying the factors of Engaging, Developing and Retaining talent in Handball of IRAN
The purpose of this study was to identifying the factors of engaging, developing and retaining talent in Handball of IRAN Based on the grounded theory approach. this research was an exploratory research and a qualitative nature. the data gathered through documents and interview for 15 handball experts in deep-interview and semi-structured forms. the sample selected through subjective sampling a...
متن کاملA mesh generation procedure to simulate bimaterials
It is difficult to develop an algorithm which is able to generate the appropriate mesh around the interfaces in bimaterials. In this study, a corresponding algorithm is proposed for this class of unified structures made from different materials with arbitrary shapes. The non-uniform mesh is generated adaptively based on advancing front technique available in Abaqus software. Implementing severa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012